|
|
Accession Number |
TCMCG018C02069 |
gbkey |
CDS |
Protein Id |
XP_004136154.1 |
Location |
complement(join(18939143..18939393,18940074..18940203,18940417..18940516,18940622..18940701,18941071..18941208,18941292..18941578,18941927..18942089,18942705..18942806,18942904..18943106,18943187..18943365,18943453..18943544,18943683..18943812,18943903..18944453,18945042..18945233,18945407..18945490,18945617..18945685,18945804..18945878,18945964..18946017,18946116..18946218,18946424..18946517,18946899..18947745)) |
Gene |
LOC101219303 |
GeneID |
101219303 |
Organism |
Cucumis sativus |
|
|
Length |
1307aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA182750 |
db_source |
XM_004136106.3
|
Definition |
DNA mismatch repair protein MSH6 [Cucumis sativus] |
CDS: ATGTCATCATCTCGTCGATCCAGCAATGGCCGATCGCCATTAGTTAACCAACAACGTCAAATCACTTCCTTCTTTACCAAAAAACCCACCGGAGACAACTCCGCTGCTAGAACTCATTCCATTTCCTCCCCGACTCCAAGCCCTAGCCCTAACATTAATTCTCCTCCGTCAGTACAGTCCAAGCGCAAGAAACCCCTGTTGGTTATCGGCGGCGGTGCTCCGCCTTTTTCTTCTTCTTCTCCCGGTTCTTCTTCTTTACCTGATGCGGAGGAGAAATCGCACGGTGATGGGGTTATTGGGAAGAAGATAAAGGTTTATTGGCCGTTGGATAAGACCTGGTACGAGGGTCGTGTGAAAATGTTTGATGAGAAGGCTGGAAAGCATTTGGTGCAGTATGACGATGCGGAGGAGGAGTTGTTGGTGTTGGGGAACGAAAAGATTGAGTGGGTTGAGGAAAGTGCGAAGAAGTTTAAGCGATTGCGACGAGGTTCTTCACCGCCGGTGAGTGCTGCAGTGCTGGAAGATATGGATGATTTAAATGATTTAAGTGACGGGGATGGCAGCGATGACTCTAGAGATGAAGATTGGGGGAAGAATGTGGAGAACGAGGTGAGTGAGGAGGAGGATGTGGATCTGGTGGAGGAAAATGAAGACGAAGATGGGAGCGAGGAGGATGGAGTGGGAAAGTCAAGAAGGAAGCAGGGTGGGCAGGTGGAGTCTAAAAAGCGTAAGATGAGTAATGGAAAGAAAGTTGAAGTTGCTCCAAAGAAGATCAAGAGTAGCGGTGGAAGTGTGACTTCTGGAGGGTTACAACTTTCTTCAATGGAGACTAAGATTAAATCAGAGAGTACAAGTGTATTAAAGGGAATAAATGAAATTGCAAGTGATGCCTTAGAAAGGTTCAACTCACGAGAAGCCGAAAAGTTCAGGTTTCTGAAAGAAGATAGGAAGGATGCAAATAAAAGATGTCCAGGAGATCCCGATTATGACCCAAAAACTTTGCATTTGCCTCCATATTTTGTGAAGAATTTATCAGATGGCCAGAGACAATGGTGGGAGTTTAAGTCAAAACACATGGATAAAGTTCTGTTTTTCAAGATGGGTAAATTTTATGAACTTTTTGAAATGGATGCACACATAGGAGCTAAAGAACTTGATTTGCAATATATGAAGGGAGATCAACCTCATTGTGGCTTCCCCGAGAGAAACTTTTCACTCAATGTGGAGAAGTTGGCAAGGAAGGGTTATCGAGTTCTTGTCATAGAGCAGACAGAAACTCCTGAACAATTAGAGAGACGGCGTAAGGAGAAAGGTTCTAAGGACAAGGTAGTGAAACGCGAAATATGTGCCGTAGTCACAAAAGGAACACTAACCGAGGGTGAGATGCTATCTTTGAATCCTGATGCTTCATATCTGATGGCAGTAACCGAAAATTTTTATGGGTTGGAAAATCAACAAGAACGGATTTTAGGGGTTTGTGTGGTTGATGTGGCTACCAGTAGGGTTATCCTTGGGCAGTTTGGAGATGACTCGGAGTGCAGTGCCTTGTGCTGTCTTTTGTCCGAGCTTAGACCAGTTGAAATTATTAAACCAGCTAAACTGCTAAGTCCTGAAACTGAGAGGGTGCTGCTCACTCATACGAGAAATCCTTTAGTGAATGAGTTAGTTCCATTATTGGAATTCTGGGATGCTGAGAAAACTGTTCAAGAAGTTAAGAGGTTGTTTAAGGGCATTGCTAATAGATCGGTTTCTGGATCTTCAAGTGAAGCAAGTTTACTCAATGACAATGCTGCCAGAGAAAACGATGGGTTGAGCTACATGCCAGATGTTTTATCCGAACTGGTTACTGCAGACGAAAATGGGTCTTGGGCACTTTCAGCTCTTGGAGGCATTCTATTCTATCTGAAGCAAGCTTTTCTGGATGAGACATTGCTTAGATTTGCAAAGTTTGAATTACTTCCTTGTTCTGGCTTCAGTGATGTTATTTCAAAACCCTATATGGTTCTTGATGCAGCTGCCTTAGAAAATCTAGAGATCTTTGAGAACAGCAGAAACGGGGATTCTTCTGGGACGCTCTATTCACAGTTGAACCACTGTGTAACTGCATTTGGGAAAAGATTACTTAAGACATGGCTTGCAAGGCCTTTATATCACGTTGAATCAATTGAAGCTAGGCAAGGTGCTGTGGCGAGCCTACGGGGAGATAACTTATCTTTTTCTCTTGAGTTTCGAAAAGCATTATCCAAACTTCCTGATATGGAGCGTCTACTTGCTCGCATTTTTTCTAATAGTGAGGCAAATGGGAGGAATGCTATCAATGTGGTTCTATATGAGGATGCAGCCAAAAAACAACTACAAGAGTTCATATCTGCTTTGCGGGGTTGTGAGCTCATGCTCCAAGCTTGTTCGTCGCTCCGTGTCATTTTGCCAAATGTTAAATCAAGAAGACTCGATTGCCTATTAACGCCAGGTGAAGGTCTTCCAGATCTTCATTCAGTTCTAAGTCATTTCAAGGATGCTTTTGATTGGGTTGAAGCCAATAGTTCAGGACGTGTAATACCTCGCGAAGGTGTAGACGTGGAGTATGACTCTGCCTGTGAGAAAATTAGGGAGATACAATCTAGCTTGACAAAGCATCTAAAGGAACAGCGGAAATTACTTGGGGACACATCTATCACTTATGTGACAGTTGGAAAAGAGACACATTTGTTGGAAGTGCCTGAAAGTTTGCAGGGTAACATTCCTCAGACTTATGAGTTGCGATCATCTAAAAAGGGCTTCTTTCGGTACTGGACTCCTAATATTAAGAAGTTGTTAGCGGAGCTTTCTCTAGCTGAATCTGAGAAGGAGTCCTCACTGAAAAGCATTTTGCAAAGGTTAATCAGAAAATTCTGTGAACATCATCTCCAATGGAGACAATTAGTCTCTGCAATTGCTGAACTTGATGTTTTGATTAGCTTAGCAATTGCAAGTGATTATTATGAGGGTTACACATGCCAACCACTTTTCTCGAAGTCACAGTGTCAGAATGAAGTGCCACGTTTTACTGCTAAAAACTTAGGACATCCCATTCTAAGAAGTGATTCACTGGGTGAGGGTACATTTGTCCCCAATGACATTACTATTGGTGGCTCAGGAGCCAACTTCATTCTTCTGACTGGGCCTAACATGGGTGGAAAGTCTACTCTTCTTCGGCAAGTTTGCTTGTCTGTTATTCTGGCTCAGATAGGTGCAGATGTTCCTGCAGAAAGTTTTGAGTTAGCTCCTGTTGATCGAATTTTTGTACGGATGGGTGCTAGGGATCAGATTATGTCTGGCCAAAGTACATTTTTGACAGAACTATCAGAAACTGCACTGATGCTGTCATCAGCTACCCGTAATTCAGTGGTGATCTTGGATGAACTTGGACGTGGTACGGCAACTTCAGATGGACAGGCAATTGCGGAATCAGTTCTTGAACATTTTGTTAGCAAGGTGCAGTGCAGGGGAGTATTCTCAACTCATTATCACCGATTGGCCTTGGCTTATCATAAAGATCCTAGGGTTTCATTACACCATATGGCATGTCGAGTTGGAGAGGGAAACAACGGTTTAGAAGAAGTTACATTTCTCTATCGTCTAACTCCTGGCACATGCCCTAAAAGTTATGGCGTGAATGTTGCACGGTTAGCTGGACTCCCAAATTGTGTCTTGACCGAGGCTGCGGCTAAATCAATGGAATTTGAGGTTACATATGGCATGGCTGGAGAAGAATCTGAAGTTGACTTGTGCAATCAAACTTGGGTAGATGATACAACAACTTTGATTCAAAAGTTGATAAGCCTGGAATCAGCTGTGAGATGCAATGATGAAACTGAGAAGAATGGTATCGGTTCCTTGAAACAGCTTCAACAACAAGCAAGAATACTTGTGCAGCAAGGTTGA |
Protein: MSSSRRSSNGRSPLVNQQRQITSFFTKKPTGDNSAARTHSISSPTPSPSPNINSPPSVQSKRKKPLLVIGGGAPPFSSSSPGSSSLPDAEEKSHGDGVIGKKIKVYWPLDKTWYEGRVKMFDEKAGKHLVQYDDAEEELLVLGNEKIEWVEESAKKFKRLRRGSSPPVSAAVLEDMDDLNDLSDGDGSDDSRDEDWGKNVENEVSEEEDVDLVEENEDEDGSEEDGVGKSRRKQGGQVESKKRKMSNGKKVEVAPKKIKSSGGSVTSGGLQLSSMETKIKSESTSVLKGINEIASDALERFNSREAEKFRFLKEDRKDANKRCPGDPDYDPKTLHLPPYFVKNLSDGQRQWWEFKSKHMDKVLFFKMGKFYELFEMDAHIGAKELDLQYMKGDQPHCGFPERNFSLNVEKLARKGYRVLVIEQTETPEQLERRRKEKGSKDKVVKREICAVVTKGTLTEGEMLSLNPDASYLMAVTENFYGLENQQERILGVCVVDVATSRVILGQFGDDSECSALCCLLSELRPVEIIKPAKLLSPETERVLLTHTRNPLVNELVPLLEFWDAEKTVQEVKRLFKGIANRSVSGSSSEASLLNDNAARENDGLSYMPDVLSELVTADENGSWALSALGGILFYLKQAFLDETLLRFAKFELLPCSGFSDVISKPYMVLDAAALENLEIFENSRNGDSSGTLYSQLNHCVTAFGKRLLKTWLARPLYHVESIEARQGAVASLRGDNLSFSLEFRKALSKLPDMERLLARIFSNSEANGRNAINVVLYEDAAKKQLQEFISALRGCELMLQACSSLRVILPNVKSRRLDCLLTPGEGLPDLHSVLSHFKDAFDWVEANSSGRVIPREGVDVEYDSACEKIREIQSSLTKHLKEQRKLLGDTSITYVTVGKETHLLEVPESLQGNIPQTYELRSSKKGFFRYWTPNIKKLLAELSLAESEKESSLKSILQRLIRKFCEHHLQWRQLVSAIAELDVLISLAIASDYYEGYTCQPLFSKSQCQNEVPRFTAKNLGHPILRSDSLGEGTFVPNDITIGGSGANFILLTGPNMGGKSTLLRQVCLSVILAQIGADVPAESFELAPVDRIFVRMGARDQIMSGQSTFLTELSETALMLSSATRNSVVILDELGRGTATSDGQAIAESVLEHFVSKVQCRGVFSTHYHRLALAYHKDPRVSLHHMACRVGEGNNGLEEVTFLYRLTPGTCPKSYGVNVARLAGLPNCVLTEAAAKSMEFEVTYGMAGEESEVDLCNQTWVDDTTTLIQKLISLESAVRCNDETEKNGIGSLKQLQQQARILVQQG |